Compactified Horizontal Visibility Graph for the Language Network

نویسندگان

  • D. V. Lande
  • A. A. Snarskii
چکیده

A compactified horizontal visibility graph for the language network is proposed. It was found that the networks constructed in such way are scale free, and have a property that among the nodes with largest degrees there are words that determine not only a text structure communication, but also its informational structure. Construction of networks with text elements, words, phrases or fragments of natural language as nodes in some cases allows to detect the structural elements of the text critical for its connected structure and find informationally significant elements, as well as words that are secondary for understanding of the text. Such networks may also be used to identify unconventional text components, such as collocations, supra-phrasal units [1], as well as for finding similar fragments in different texts [2]. There is a multitude of approaches to constructing networks from the texts (so-called language networks) and different ways of interpreting nodes and links, which causes, accordingly, different representation of such networks. Nodes are connected if corresponding words are either adjacent in the text [3, 4], or are in a single sentence [5], or are syntactically [6, 7] or semantically [8, 9] connected. At the intersection of digital signal processing (DSP) theory and complex network theory there are several ways of constructing networks from the time series, among those are visibility graph construction methods (see survey [10]), namely the horizontal visibility graph (HVG) [11,12]. Based on these approaches, networks can also be constructed from texts in which numeric values are assigned in some manner to each word or phrase. The examples of functions assigning a number to a word are: ordinal number of a unique word in a text, length of the word, " weight " of the word in a text, e.g., generally accepted TFIDF metric (canonically, a product of the term frequency in a text fragment and a

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multiplex visibility graphs to investigate recurrent neural network dynamics

A recurrent neural network (RNN) is a universal approximator of dynamical systems, whose performance often depends on sensitive hyperparameters. Tuning them properly may be difficult and, typically, based on a trial-and-error approach. In this work, we adopt a graph-based framework to interpret and characterize internal dynamics of a class of RNNs called echo state networks (ESNs). We design pr...

متن کامل

Epileptic seizure detection based on The Limited Penetrable visibility graph algorithm and graph properties

Introduction: Epileptic seizure detection is a key step for both researchers and epilepsy specialists for epilepsy assessment due to the non-stationariness and chaos in the electroencephalogram (EEG) signals. Current research is directed toward the development of an efficient method for epilepsy or seizure detection based the limited penetrable visibility graph (LPVG) algorith...

متن کامل

Analysis of the Effect of Wind Speed and Soil Moisture on Horizontal Visibility Variations Caused by Dust Event in Arid Regions (Study Region: Southeast of Iran)

Introduction: Increasing or reducing the production of suspended particles depends on the important climatic and terrestrial characteristics of a region. One of the most important climatic factors affecting soil erosion is wind speed, so if the surface winds speed in a region exceeds the threshold of erosion, soil susceptibility to erosion and dust production increases. In contrast, the most im...

متن کامل

Analysis of The climatic changes in Kermanshah province based on the five criteria of precipitation, temperature, wind, weather phenomena, and horizontal visibility

Score for exposure factor is the sum of scores for the five criteria of precipitation, temperature, wind, weather phenomena, and horizontal visibility. Hourly data on precipitation, temperature, relative humidity, weather condition codes, and horizontal visibility obtained at 23 synoptic stations in the Province, and from those located in an area within a radius of 100 kilometers from the borde...

متن کامل

Mapping Time Series to Networks: a Brief Overview of Visibility Algorithms

In the last years a new approach for making time series analysis has appeared. This new approach considers the mapping of time series to networks, in order to characterize the structure of time series (and therefore the dynamics that generated the series) via characterization of the associated network. It makes use of several metrics recently developed in the so called Complex Network theory, a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1302.4619  شماره 

صفحات  -

تاریخ انتشار 2013